Analysis of the Difficulties in Chinese Deep Parsing
نویسندگان
چکیده
This paper discusses the difficulties in Chinese deep parsing, by comparing the accuracy of a Chinese HPSG parser to the accuracy of an English HPSG parser and the commonly used Chinese syntactic parsers. Analysis reveals that deep parsing for Chinese is more challenging than for English, due to the shortage of syntactic constraints of Chinese verbs, the widespread pro-drop, and the large distribution of ambiguous constructions. Moreover, the inherent ambiguities caused by verbal coordination and relative clauses make semantic analysis of Chinese more difficult than the syntactic analysis of Chinese.
منابع مشابه
Evaluation Reportof the third Chinese Parsing Evaluation: CIPS-SIGHAN-ParsEval-2012
This paper gives the overview of the third Chinese parsing evaluation: CIPS-SIGHANParsEval-2012, including its parsing sub-tasks, evaluation metrics, training and test data. The detailed evaluation results and simple discussions will be given to show the difficulties in Chinese syntactic parsing.
متن کاملEvaluation Reportof the fourth Chinese Parsing Evaluation: CIPS-SIGHAN-ParsEval-2014
This paper gives the overview of the fourth Chinese parsing evaluation: CIPS-SIGHANParsEval-2014, including its parsing, evaluation metrics, training and test data. The detailed evaluation results and simple discussions will be given to show the difficulties in Chinese syntactic parsing.
متن کاملAn Algorithm Combining Statistics-based and Rules-based for Chunk Identification of Chinese Sentences
Natural language processing (NLP) is a very hot research domain. One important branch of it is sentence analysis, including Chinese sentence analysis. However, currently, no mature deep analysis theories and techniques are available. An alternative way is to perform shallow parsing on sentences which is very popular in the domain. The chunk identification is a fundamental task for shallow parsi...
متن کاملComparative Analysis of the Kurdish Problem in Turkey and the Issue of Chinese in Malaysia within the Context of Nation-State and Ethnic Differences: Advantages and Disadvantages in terms of Turkey
In this study, the ethnic problems which are one of the most significant and perhaps the primary structural difficulties and problems of the nation-state and possible solutions will be suggested. Even if it starts with the general information, the focus of this study would be the subject matter which is known as the Southeastern Problem in Turkey yet it has started to be mentioned as the Kurdis...
متن کاملGrammatical Relations in Chinese: GB-Ground Extraction and Data-Driven Parsing
This paper is concerned with building linguistic resources and statistical parsers for deep grammatical relation (GR) analysis of Chinese texts. A set of linguistic rules is defined to explore implicit phrase structural information and thus build high-quality GR annotations that are represented as general directed dependency graphs. The reliability of this linguistically-motivated GR extraction...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011